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This invention relates the enhancements of neural network- 
assisted reservoir characterization techniques for geological 
classification . from measured input data. 

According to the present invention, the terms "measured input 
data" or "INPUT DATA" refers to, in particular, downhole logs. 
The set of logs used in the testing of the method of the 
invention includes gamma ray (GR) , sonic slowness (DT) , thermal 
neutron porosity (NPHI) , bulk density (RHOB) and true 
resistivity (RT) , all measured at same depth for each sample, 
and at a constant sampling distance. However, INPUT DATA are not 
restricted to samples at a single depth. Alternatively, 
attributes that represent, for example, sliding window averages 
or other statistics taken over a depth range in the neighborhood 
of the depth of interest, can be constructed. 2D image logs 
(e.g., FMI) or 3D seismic cubes are also encompassed. 

According to the present invention, the terms "geological 
classes" or "CLASSES" refers to, principally, the rock facies 
(lithofacies) or the reservoir rock types. However, any other 
discrete classification of geological features (e.g. 
petrophysical properties) is possible. 



Rock facies class prediction by neural network processors 
applied to downhole logs is an existing method developed in the 
nineteen nineties which gave rise to several publications [1] . 
For instance, it has been implemented by an ENI AGIP E&P team, 
and integrated in a joint development project into the product 
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RockCell™ within the Schlumberger™ GeoFrame™ oilfield 

interpretation software platform. 

For rock facies estimation, a set of single- channel log curves 
are selected. Typical logs used are gamma ray (GR) , sonic 
slowness (DT) , thermal neutron porosity (NPHI) , and bulk density 
(RHOB) , but this list is not limited. New attributes can also be 
generated from existing logs in order to reveal additional 
features in the logs. 

A current limitation in analyzing geological measured data such 
as downhole logs, is that their relationship to classes such as 
rock facies is not obvious. In each borehole, there are unknown 
local factors that may affect the data in unexpected ways. It 
can thus be risky to classify on a simplified theoretical 
analysis or by data clustering. There is a need for a method to 
identify associations between input data and to build implicit 
complex functional relationships. A "learn .from examples" method 
is more preferred to building an expert system. The discovered 
methods would then be used to predict .the classes and their 
associated probabilities. 

An Artificial Neural Network (ANN) scheme has been developed to 
implement learning by example as applied to downhole geological 
classification. Neural networks can "learn" specific computation 
schemes. Once trained, a neural network can find acceptable 
solutions on any set of data referring to the . learned schemes. 
This gives artificial neural networks an ability to generalize 
from training experience (see [12]). Unlike analytical 
approaches such as statistics, neural networks require no 
explicit computational model, and are not limited by a lack of 
normality or the non-linearity of the physical phenomenon. As a 
consequence, they "learn" relationships between data that may be 
hard to discover with analytical methods. 
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The behavior of a neural network is defined by its architecture. 
This architecture consists of the way its neurons (individual 
computing elements) are connected and by strength (weight) of 
5 those connections. Each neuron performs a weighted sum (linear 
combination) of its inputs, then applies an almost non-linear 
activation function, to finally produce an output. The resulting 
output of a given neural layer is forwarded to the next layer 
and so on through the network. In other words, neural networks 
10 plainly perform a massively parallel set of elementary 
computations. Whereas the weights vary the strength of. 
connections from one node to another, the sigmoidal activation 
function provides the highly non- linear property of neural data 
processing. 

15 

The main advantage of those neural nets is their learning 
capability. During the learning phase, v given a training set of 
'data,, the interconnection weights are gradually adjusted so as 
to stabilize the network's output, and, in the case of the 
2 0 supervised learning, to minimize the mean square error between 
the effective output and the desired one. The preferred 
implementation of the NN is a supervised feed- forward, multi- 
layer perceptrons trained with the back-propagation algorithm. . 

» 

2 5 Methods and techniques used today are able to classify without 
the a priori knowledge of classes sequencing. The prediction 
operates on geological input data sample-by- sample, and produces 
for each input pattern the probabilities of the most likely 
classes. 

30 

However, this system sometimes fails in its predictions. One of 
its main limitations is that it does not honour geological prior 
knowledge. Some of the predictions fail due to the fact that 
geologically improbable classes transitions are often observed. 
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Sedimentologists have observed that the vertical and lateral 
sequence of geological facies 1 seen in outcrop and in the 
subsurface, are not random. Since the stratigraphic layering in 
5 the earth represents successive time of deposition, the . rock 
record actually represents a time series of events. Since the 
normal neural net techniques make sample-by- sample predictions, 
they do not consider previous states of prediction (e.g., the 
facies predicted at location X n _i, which implies t n -i, constrains 

10 the prediction at location X) and they fail to take advantage of 
likely non-random transitions between lithology or facies. 
Geology can . provide strong constraints on the prediction of 
stratigraphic successions. Sedimentologists have long invoked 
Markov models for analyzing the vertical and lateral sequences 

15 [2, 3, 4, 5]. Therefore, using a Markov scheme using geological 
prior information of rock facies transition probabilities seems 
a fruitful way to improve the prediction of the neural network 
scheme. 

20 Systems for speech recognition, integrating a neural network and 
a Hidden Markov Model (HMM) , are known from the state of the 
art. HMMs are used as a major approach in the majority of 
continuous speech recognition systems. They provide, an accurate 
and reliable framework for segmentation and classification of 

25 speech. HMM states can stand for the phone classes, c± (e.g., 

phonemes) to be identified, whereas the HMM observation sequence 
for the acoustic vectors y (e.g., a combination of cepstral and 
energy acoustic • parameters) . As a consequence, the state 
sequence X = x^, . • - / xrp of length T can be considered as 

30 the. "sentence" to be recognized due to the recorded and 
discretized acoustic observation sequence Y = y%, y2, yp- 
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Facies sequences have been considered as analogous to the 
phoneme sequences in the speech recognition methods. The HMM and 
its stochastic behavior represent the allowed or forbidden 
5 transitions between geological classes and their associated 
probabilities, and the geological input data are analogous to 
the acoustic observation vectors used during the speech 
recognition process . 

10 The HMM technology has already been applied to lithofacies 
classification from well logs. Publications [6], [7] and [8] 
describe the building, training and application of a Hidden 
Markov Model to estimate the lithology of uncored boreholes 
based on key learning data sets- where the lithology is known. In 

15 those methods, the lithofacies sequence stands for the 
consecutive states of the HMM, and the log data for the 
observations . Those methods do not rely on the use of a neural 
network. This means they are able to model the stochastic 
character of rock facies transitions and the rock facies 

20 sequences. However, they perform poorly while modeling the non- 
linear relationship between logs and rock facies, as they do not 
benefit from the complex neural network architectures and 
computation schemes . 

25 In the papers [9] and [10], and in several patents concerning 
speech recognition, such as [11] , an interesting approach to 
classify speech phonemes has been developed by the use of hybrid 
models . mixing both HMM and ANN. Those approaches enable speech 
recognition systems to cope with the strong statistical 

3 0 assumptions of the HMMs . 

Applying a feed- forward neural network to the input data y can 
give us estimates of the conditional posterior probabilities 
£>(x±Iy) of each class x^, given the current input vector y. 
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Those class-conditional* posterior probabilities must sum to one, 
and therefore need to be normalized. However, a HMM needs the 
conditional prior probabilities p(y/x£). Assuming there are 

enough training data and that the training does not get held up 
5 in poorly performing local minima, the feed- forward neural 
network is able to approximate the prior probabilities thanks to 
Bayes< rule. Indeed, p(y/x±) - p(x±/y) x p(y) / p(x±). The prior 

probability distribution of classes is context-dependent but can 
be estimated by counting the classes occurence of classes in the 
10 learning set, or by introducting prior knowledge. The prior 
probability of the observation vector can be discarded as for 
each time step; it is independent of the phone class. 

The HMM and observation sequence finally provide, thanks to the 
15 Viterbi algorithm, the most likely state sequence which caused 
the observed acoustic data sequence. 
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SUMMARY OF THE INVENTION 

The starting point of this invention consists of enhancing the 
neural networks algorithms to make their predictions more 
accurate and robust in oilfield applications. 
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In a first aspect, the invention concerns a system for inferring 
geological classes from oilfield well input data comprising a 
neural network for inferring class probabilities, characterized 
5 in that : said system further comprises means for integrating 
class sequencing knowledge and optimising said class 
probabilities according to said sequencing knowledge. 

Preferably, the means for integrating class sequencing knowledge 
10 and optimising said class probabilities according to said 
sequencing knowledge comprises a hidden Markov model. 

In a second aspect, the invention concerns a method for 
inferring geological classes from oilfiled well input data, 
15 comprising the following steps: - inferring class probabilities 
with a neural network; and - integrating class sequencing 
knowledge and optimising said class probabilities according to 
said sequencing knowledge. 

20 Preferably, integrating class sequencing knowledge and 
optimising said class probabilities according to said sequencing 
knowledge is achieved according to a hidden Markov model. 

Advantageously, the invention relates to a system and method for 
25 inferring geological classes from single-channel oilfield input 
data by applying hybrid neural network hidden Markov models 
classifiers. 

The geological classification is inferred using supervised 
30 neural networks that are applied to the input data and that 
predict the associated classes. The vertical class transition 
constraints are learned within a Markov class transition table 
and a prior class distribution, which are then reused during the 
estimation of the classes. This optimizes the predicted class 
35 curve and honours geological prior knowledge. 



- 8 - 



WO 2004/066060 PCT/GB2004/000304 

This invention relates the enhancements of artificial neural 
network (ANN) reservoir characterization techniques for 
geological classification. Supervised neural network classifiers 
can be applied to downhole logs to automatically predict 
5 lithology or other classes in boreholes. However, ANN systems 
sometimes infer geologically incorrect vertical ( stratigraphic) 
class transitions within a borehole. The root cause of these 
errors is the fact the networks analyze and predict the output 
classes sample-by- sample, without taking the whole borehole 

10 sequence of classes into account. Improving the prediction of 
lithofacies from downhole logs is solved by the system and 
method of the present invention. In essence, they do not take 
into account local information that is commonly important in 
stratigraphic rock sequences. Geological transitions are 

15 commonly not random, but predictably sequenced. 

The system integrates an a priori knowledge of class sequencing 
and of class probability distribution in the neural network 
predictor. It combines a supervised back-propagation, feed- 

2 0 forward neural network architecture with a Hidden Markov Model 
module into a complex hybrid neural processing chain. The second 
processing step optimizes the class * stratigraphic sequence. 
Instead of simply choosing, for each set of input data the class 
that is the most probable, the chosen class is the one which has 

25 both a reasonable occurrence probability given the input data 
pattern and a reasonable occurrence probability given the 
previous estimated class. Such a choice governed both by class 
transition and a posterior class observation probability is 
implemented through the Viterbi algorithm. 

These and other features of the invention, preferred embodiments 
and variants thereof, possible applications and advantages will 
become appreciated and understood by those skilled in the art 
from the following detailed description and drawings. 

35 

DRAWINGS 
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Figure 1 is a block-diagram of the training of the hybrid 
ANN-HMM classification system. The training set consists of 
INPUT DATA across several wells and associated core information. 
5 The normalization of INPUT DATA and the generation of additional • 
attritiites is optional (dotted arrows) . The construction of the 
HMM during the training phase is optional as well, if it is not 
essential to compute it for the training data set. 

Figure 2 is a block-diagram of the estimation of the hybrid 
10 ANN-HMM lithofacies classification system on uncored boreholes 
by applying the system to well logs. 1 r he normalization of. INPUT 
DATA and the generation of additional attritutes- is optional, 
(dotted arrows) . As for the HMM, one can load an existing HMM 
from the data storage system and / or manually define it on' the 
15 basis of the geological prior knowledge. 

Figure 3 is a block diagram of the Hybrid ANN-HMM 
processing chain in the geological classification estimation 
mode. The supervised, neural network module aims to predict the 
posterior class probabilities given an observation. The Viterbi 
20 processing optimizes the predicted class path. 

Figure 4 depicts the same process as in figure 3, but with 
a state- to-state HMM instead of a sample-by- sample one. 

Figure 5a shows a particular ANN architecture where the 
neural network integrates a Kalman- trained matrix K. 

25 Figure 5b illustrates the concept of neural network expert 

committee . 

MODE(S) FOR CARRYING OUT THE INVENTION 

30 The Hybrid ANN / HMM of an example of the invention is composed 
of two different components, which are the ANN posterior CLASS 
probability estimator, and the HMM, comprising only a CLASS 
transition table and a CLASS probability distribution. In the 
present example those components are trained separately during 
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the training phase of the system, as they do not need to 
interfere with one another during the learning step. They are 
also applied separately during the estimation step. 

5 In the following single steps and components of the example of 
the invention are described in greater details making reference 
to figure 1 to 5 . 

1 . Data choice and input 

10 Processing of the INPUT DATA is. done on a sample-by- sample 

basis, and therefore the CLASS probabilities are estimated for 
each sample. 

1.1. Borehole choice (see step 1.1 to 1.3 on figure 1 and 
15 2.2 on figure 2) 

Both the learning and the estimation of the Hybrid HMM / ANN 
classification system can be done on several wells, as long as 
they share the same geological INPUT DATA and properties. This 
2 0 system is by consequence designed to propagate the knowledge of 
the physical and statistical relationships between INPUT DATA 
and CLASSES, as measured in one or several wells, to the whole 
set of boreholes within an oilfield. 

25 If one or more INPUT DATA curves are missing, they can be 
estimated based on available data. One can for instance 
integrate synthetic, logs so as to perform rock class estimation. 

1.2. Input data choice (see step 1.1 on figure 1 and 2.1 on 
30 figure 2) 
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The following section log curves are used as INPUT DATA. For the 
purposes of testing and validating the method outlined here, the 
set of logs used includes gamma ray (GR) , sonic slowness (DT) , 
thermal neutron porosity (NPHI) , and bulk density (RHOB) , all 
5 measured at same depth for each sample, and at a constant 
sampling distance. A regression is often done on those data by 
taking several samples above and several samples below the 
depth of interest. 

10 However, any set of ID, 2D or 3D geological or geophysical 
information is suitable for this system. ID depth- oriented 
attributes are extracted from the existing information. 
Alternatively sliding window statistics can be extracted at the 
neighborhood of the depth of interest, and this can be done for 

15 instance on 2D open borehole microresistivity images or 3D 
seismic cubes . 

1.3. Training data set and cross-validation data set (see 
step 1.3 on figure 1) 

20 

The learning data includes both INPUT DATA and. core or 
geologist-defined corresponding CLASSES zonation. This CLASSES 
zonation of the INPUT DATA is considered as the desired goal 
which has to be attained by the Hybrid HMM / ANN classification 
25 system. 

The supervised training of the ANN component is done for each 
sample of the INPUT DATA, and for as many epochs as necessary, 
until a global mean square error between the desired outputs and 
3 0 the actual outputs is satisfactory. A second error, the so- 
called cross-validation error, is also computed on a different 
data set, not taken into account when training the ANN. This 
monitors the generalizing abilities of the ANN, preventing the 
ANN from learning the training data set "by heart". Usually, the 
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training stops when the cross-validation error starts to 
increase. 

The separation between training set and cross-validation set is 
5 done on the basis of random choice. The total percentage of data 
being selected for. the cross-validation set is chosen during the 
ANN architecture choice step, for instance, p = 50%. Then, for 
each sample of the INPUT DATA set, that sample is randomly 
attributed to the training set or the cross-validation set 
10 according to the probability p. 

1.4. Additional attributes generation (see step 1.4 on 
figure 1 and 2.3 on figure 2) 

15 Under certain circumstance, for example if the INPUT DATA does 
not contain, on a localized sample-by-sample basis, enough 
spatial information which could help to discriminate among 
CLASSES, additional information may be extracted from existing 
data, e.g. seismic data or logs. That information can for 

20 instance, show the evolution of the INPUT DATA curves, the 
energy contained in the curves or the smoothed low- frequency 
component of the curves . In order to obtain such information one 
can apply a set of gradients to the INPUT DATA curves, or 
extract low frequencies using for example the Fast Fourier 

25 Transform, or approximate the INPUT DATA curves with Polynomial 
curves on small windows, i.e. a subset of the data. 

This additional attributes generation is done both for the 
learning data set and for the estimation data set. 

30 

1.5. Log data normalization (see step 1.5 on figure 1 and 
. 2.4 on figure 2) 
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The generalization performance of the ANN can be enhanced using 
a pre-processing step consisting of normalizing of the data. 
This optional step can consist of one or more of the following 
computations : Mean - Standard Deviation Normalization, 
5 Principal Component Analysis (with retention of the principal 
components which account for 95 or 98 % of the data) , Mininum - 
Maximum Normalization. or other known methods for data 
normalization. 

10 2.. Neural network component 

Several architectures, training algorithms, and methods of 
implementation are possible for the neural network. In the 
present example the component is a feed- forward MLP (Multi-Layer 

15 Perceptron) , with an input layer (one neuron per log data 
attribute) , one or several hidden layers, and an output layer 
(one neuron per CLASS) . The outputs O = (o±, o 2 , ... o N ) of the ANN 
have to be the probabilities of each CLASS, according to the 
current INPUT DATA, and as a consequence have to equal 1 and 

20 each belong to the interval [0, 1] . 

2.1. Choice of the neural network architecture (see step 
1.6 on figure 1 and figures 5a, 5b) 

25 A preferred embodiment of this method is the three-layered 
neural network, with: sigmoid activation functions; bias; as 
many nodes on the first layer as there are log attribute inputs, 
for instance 4, then 10 nodes on' the first hidden layer and 10 
nodes oh the second hidden layer. The number of nodes on the 

30 output layer is the same the number of CLASSES. 

As illustrated in figure 5A, an additional linear matrix K can 
be added to the ANN after the output layer of the neural 
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network. In this case, the last neural layer does not need to 
have as many nodes as there are CLASSES , but the linear matrix K 
has to be correctly sized and performing the following operation 
: X = K Xh, where X is a vector of size N {N being the number of 
5 CLASSES), Xh is a vector of size Nh coming out from the last 
neural layer, and K is a matrix of size NxNh. 

Independent of which ANN architecture is retained, several ANN 
modules can be combined into a neural network "expert committee" 
10 as shown on figure 5b, step 5b. 2. 

After selection of the ANN architecture the neural networks are 
trained'. 

2.2. Training the ANN (see step 1.7 to 1.10 on figure 1) 

15 2.2.1. Evaluation of the network performances at each 

step (see step 1.8 on figure 1) 

Evaluation is realized by computing the global RMSE (Residual 
Mean Square Error) between the desired outputs as provided by 
20 the training or the cross-validation data set, and the actual 
outputs of the ANN. Two curves corresponding to that training 
and cross-validation error are displayed and monitored during 
the training process . 

25 2.2.2. Error Back- Propagation (see step 1.7 on figure 1) 

The supervised training of the neural network is performed by 
the Error Back-Propagation, and the algorithm used can be, for 
instance, Gradient Descent with Adaptative Learning Rate and 
30 Momentum. Thi's means that the difference between the expected 
CLASS probability as provided by the training data set, and the 
actual current output of the ANN, is propagated backwards 
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through the ANN and the neural weights are accordingly updated. 
The Adaptative Learning Rate means that this correction is 
proportional to a learning rate which is tuned accordingly to* 
the evolution of the global RMSE. The Momentum means that a term 
5 corresponding ■ to the total sum of the neural weights of the 
network is added to that global error, with the aim of avoiding 
the values of those weights increasing too much. 

2.2.3. ANN expert committees (see figure 5b) 

10 

Instead of one ANN, one can run several ANN and the average of 
their outputs (see step 5b. 3) can be taken as CLASS probability. 
The training of each ANN module of that committee is done using 
to a bootstrap, procedure (see step 5b. 1) , which includes the 
15 steps of slightly altering the training set for each ANN 
(different partition of INPUT DATA samples between- the training 
set and the cross-validation set) , and randomly initializing the 
neural weights of the ANN before training. The generalization 
abilities of the ANN are then enhanced. 

20 - 

2.2.4. Training of the K matrix by Kalman filtering (see 
figure 5a) 

In cases described above where a linear matric is used to 
25 associated the last layer output with the CLASS output, this K 
matrix is trained in addition to the neural network. 

This K matrix is trained in the following way: - At each epoch 
of the training, a first run of the ANN through all the INPUT 
3 0 DATA samples is realized. 
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As a result a matrix Mh is computed: Each row Xh of that 
matrix corresponds to the outputs of the last layer of the ANN, 
for a given INPUT DATA sample. 

5 - The training data set is a matrix Mt where each row 
corresponds to the CLASS probabilities Xt for a given INPUT DATA 
sample 

The matrix X is approximated by a Kalman-Filtering 
technique so as to minimize the RMSE of E = Xt .— K Xh where E 
10 and Xt are vectors of size N (N being the number of CLASSES) . 

Once the matrix K is approximated, the Back- Propagation is 
applied to ' the ANN for all the training data set samples, and 
the error is propagated through K first before being propagated 
15 through the network. 

After these steps the training is completed. 

2.2.5. Termination of the training (see step 1.9 and 
2 0 1.10 on figure 1) 

The end-user sets the number of training epochs for the NN. The 
• training might stop earlier if the cross-validation error has 
begun to increase. If the final performances of the ANN are not 
25 satisfactory, the end-user can tune the ANN parameters and try a 
different configuration. One can also envisage a system where 
several configurations of the ANN are successively automatically 
tried and the best one retained. 

30 A further component of the system is a Hidden Markov Model. A 
HMM is used to optimize the class stratigraphic sequence by 
chodsing, for each set of input data a class that is the most 
probable and which has both a reasonable occurrence probability 
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, i 
given the input data pattern and a reasonable occurrence 

probability given the previous estimated class. 
3 . The Hidden Markov Model component , 

5 

3.1. Training of the HMM 

CLASS and CLASS transistion is preferably based on prior 
geological knowledge derived for example from core evaluation or 
evaluation of log data by experts. The knowledge is split into a 
10 CLASS probabilty distribution and CLASS transistion 
probabilities . 

3.1.1. Automatic training on the cored / geologist 
defined lithofacies data (see step 1.11 on figure 1) 

15 The CLASS transition probability table depends only on the 
CLASSES of the learning set and it is therefore an absolute and 
static reference. It is computed by counting the successive 
CLASS transitions. It cannot be influenced by the neural 
predictions, and for a proper application of the' Viterbi 

2 0 algorithm, it should be learnt on a large training set of facies 
log curves. It is possible to learn the CLASS transition table 
on a set of multiple wells, and in this case the CLASS 
transitions between two different wells are obviously not taken 
into account in the computation of the CLASS transition table. 

25 ' . 

These CLASS transitions can be counted on a sample -by- sample 
basis, (i.e. for each INPUT DATA sample), or on a state-to-state 
basis, grouping all the samples from the same CLASS together 
(see figure 4) . 

30 

A similar automatic computation is done to approximate the CLASS 
probability distribution. 
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3.1.2. Geologist-driven ^correction of the HMM model (see 
step 1.12 on figure 1,. 2.5 on figure 2, and 3.1 on 
figure 3) 

This correction is performed based on the geologist's expert 
prior knowledge and can be done after the automatic estimation 
of the HMM on the learning data set, or before applying the HMM 
to a specific estimation data set of borehole logs. It relies on 
the CLASS transition probabilities, (and) the CLASS probability 
distribution. In case of rock facies classification, it can also 
rely on the lithofacies bed thickness which the geologist has 
defined in his geological study. 

After the learning or training epoches probabilities' derived 
from the HMM are combined with the neural network to generate a 
CLASS estimate. In the present example Bayesian statistics is 
1 used to estimate the CLASS based on availiable knowledge. 

3.2. Combining HMM and ANN 

3.2.1. Applying Bayes' Rule (see step 3.2 to 3.4 on 
figure 3 and 2.6 to 2.7 on figure 2) 

The ANN posterior CLASS probabilities estimator is designed to 
work independently from the observation set. Actually, this is a 
plain classifier, providing the probability p(x±) for each 
INPUT DATA sample. However, this can also be expressed as the 
posterior CLASS probabilities for each input data pattern given 
the current observation, p(x±jy). 
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A HMM model makes use of three different elements, which are: 
the state transition probabilities matrix, the state probability 
distribution, and the observation probability matrix given the 
current state p(yjx±) . Those three elements are also required 
5 for the Viterbi algorithm described below. 

In order to get p(y\x±) , one needs to apply the Bayes Rule and 
to introduce the observation (INPUT DATA) probability 
distribution and the state (CLASS) probability distribution. 
10 However, as the INPUT DATA observations are continuous, and as 
for each depth step of the Viterbi algorithm that observation 
probability remains the same for all the possible CLASSES, one 
can just discard the p(y) value. 

15 3.2.2. Applying the Viterbi optimization algorithm (see 

step 3.5 on figure 3 and 2.8 on figure 2) 

At this stage, the hybrid ANN / HMM classification system has 
both a prior CLASS distribution, a CLASS transition 

20 probabilities matrix, and the posterior CLASS probabilities 
matrix, which depend on the time patterns and the observations . 
The Viterbi algorithm can be applied to those data, provided 
that the application of Bayes' rule will transform the posterior 
CLASS probabilities into prior observation probabilities given 

25 the previous CLASS and the time pattern. 

It can be seen that there is no need to compute the markovian 
matrix of the observation probabilities given the CLASS. The 
Viterbi algorithm can therefore directly integrate the time- 
30 dependent observation probabilities given the current CLASS and 
the current time pattern. In other words, the state transition 
matrix and the state probability distribution have a static 
behavior (although they can be tuned to the context of the 
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estimation) whereas the observation probabilities, given the 
previous CLASS, are depth ^dependent . 

3.2.3. .State-to-state or sample-by- sample classes 
transitions (see figure 4) 

In case of sample-by-sample classes transitions, the Viterbi 
algorithm is applied to all the INPUT DATA and associated 
estimated CLASSES probabilities samples. 

In case of state- to-state transitions, all the consecutive 
samples which have in common the same most probable CLASS (see 
step 4.1), are grouped together (see step 4.2) and considered as 
an unique observation element. The CLASS probabilities of all 
the samples belonging to that element are also averaged (see 
step 4.3), and the Viterbi algorithm is then applied to the 
groups of observations and not to each observation sample. 

The mean of computing the CLASS probabilities of the observation 
element can be either a plain mathematical average or a more 
complex average. 

In order to. perform a state-to-state Viterbi optimization, the 
state transition probabilities and the state distribution have 
also to be computed on a state-to-state basis and not on a 
sample-by-sample basis (see 3.1.1 and step 4.4 on figure 4). 

After the Viterbi optimization, the observation elements are 
split into INPUT DATA samples again, and the CLASS curve is 
displayed on a sample-by-sample basis (see step 4,5 on figure 
4) . 
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4 . Testing results 

The simple ANN and the Hybrid ANN / HMM classifiers have been 
trained and tested on three different sets of geological data. 
5 'As input data downhole logs were used, and as classification 
results, the rock facies classes. 

4.1. Data set 1 (cored logs) 

10 The first data set used for the training contained 490 samples 
of 4 logs (DT, GR, NPHI and RHOB) and associated core facies (7 
facies classes) r and were taken from real measurements performed 
in a well between the depth of 297 5 - 3 051 m. 

15 Once trained, the hybrid system has been tested on the 
measurements from the same well between depths 2923 - 3161 m, 
which corresponded to 1562 samples from the same • logs . 

Whereas a single ANN system showed an amount of approximately 40 
20 to 45 % correct predictions, the hybrid ANN / HMM system reached 
an accuracy of 45 to 55 %. 

4.2. Data set 2 (cored logs) 

25 The data set used for the training contained 3800 samples • of 5 
logs (DT, GR, NPHI, RT and RHOB) and associated core facies (13 
facies classes) , and were taken from real measurements performed 
in 4 wells between the depth of 8000 - 9000 feet. 

3 0 Once trained, the hybrid system has been tested on the 
measurements from 4 other wells of the same field, where core 
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data were available. The accuracy of the results has been- 
significantly increased. 

4.3. Data set 3 (non-cored logs) 

In this example, the training data set contained about 2000 
samples from one well. The testing data set contained between 
1500 and 2 000 samples per well in a field of 5 wells. 

The lithofacies learning set for the Hybrid ANN / HMM system was 
provided by the results, of electrof acies predictions of an 
unsupervised neural network classifier. The stability of the 
predictions of the latter systems has then been compared with 
the predictions of a plain ANN trained on the same lithofacies 
log curve . 

The Hybrid ANN / HMM system is' more reliable than a single ANN 
system in the terms of prediction accuracy and gave less noisy 
results; it will therefore provide better geological lithofacies 
log curve estimations. 
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